Skip to content

Conversation

@MementoRC
Copy link
Owner

Summary

Integrates enhanced semantic search capabilities from the feature/task-3-enhanced-semantic-search worktree branch.

Feature Files Added

  • src/uckn/core/semantic_search_enhanced.py - EnhancedSemanticSearchEngine with SentenceTransformer + ChromaDB
  • src/uckn/core/atoms/multi_modal_embeddings.py - Multi-modal embedding generation for code/text/config/error
  • src/uckn/core/atoms/query_parser.py - Boolean query parsing with stemming and synonyms
  • src/uckn/core/atoms/search_suggestion_engine.py - Autocomplete and history tracking
  • src/uckn/mcp/tools.py - Extracted MCP tool implementations

Tests Added

  • tests/test_semantic_search_enhanced.py
  • tests/unit/atoms/test_multi_modal_embeddings.py

Quality Fixes Applied

  • Fixed unused imports in test files (F401)
  • Updated ML dependency detection to use importlib.util.find_spec pattern
  • Formatted all files with ruff

Notes

  • This PR intentionally excludes ~100+ CI fix commits from July 9-14, 2025
  • Development branch has newer CI+pixi work that supersedes those commits
  • Optional dependencies (sentence-transformers, transformers, chromadb, nltk) have graceful fallbacks

Test Plan

  • CI passes lint, format, typecheck
  • Unit tests pass
  • Integration tests pass
  • Enhanced semantic search functionality works with optional ML dependencies

🤖 Generated with Claude Code

MementoRC and others added 3 commits January 4, 2026 15:10
Adds enhanced semantic search capabilities from feature/task-3-enhanced-semantic-search:

Feature files:
- EnhancedSemanticSearchEngine with SentenceTransformer + ChromaDB integration
- MultiModalEmbeddings for code/text/config/error embeddings
- QueryParser with boolean operators, stemming, and synonyms
- SearchSuggestionEngine with autocomplete and history tracking
- Extracted MCP tool implementations

Tests:
- test_semantic_search_enhanced.py
- test_multi_modal_embeddings.py (updated to use importlib.util.find_spec)

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
- Fix SemanticSearchEngine patch path (uckn.core.semantic_search)
- Fix chromadb availability mock to use ML manager capabilities
- Convert list metadata to comma-separated strings per schema
- Use numpy.testing for embedding comparison
- Remove flaky timestamp auto-update assertion

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
- Fix MultiModalEmbeddings import path (atoms subdirectory)
- Add explicit type annotation for device attribute
- Add type ignore comment for optional import assignment

🤖 Generated with [Claude Code](https://claude.ai/code)

Co-Authored-By: Claude Opus 4.5 <[email protected]>
@MementoRC MementoRC merged commit 37137ef into development Jan 4, 2026
12 checks passed
@MementoRC MementoRC deleted the integrate/task-3-semantic-search branch January 4, 2026 23:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants